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Abstract 

This paper considers the cognitive interference channel (CIC) with two transmitters and two receivers, in which 
the cognitive transmitter non-causally knows the message and codeword of the primary transmitter. We first introduce 
a discrete memoryless more capable CIC, which is an extension to the more capable broadcast channel (BC). Using 
superposition coding, we propose an inner bound and an outer bound on its capacity region. The outer bound is also 
valid when the primary user is under strong interference. For the Gaussian CIC, this outer bound applies for |a| > 1, 
where a is the gain of interference link from secondary user to primary receiver. These capacity inner and outer 
bounds are then applied to the Gaussian cognitive Z-interference channel (GCZIC) where only the primary receiver 
suffers interference. Upon showing that jointly Gaussian input maximizes these bounds for the GCZIC, we evaluate 
the bounds for this channel. The new outer bound is strictly tighter than other outer bounds on the capacity of the 
GCZIC at strong interference (a^ > 1). Especially, the outer bound coincides with the inner bound for |a| > ^/l + Pi 
and thus, establishes the capacity of the GCZIC at this range. For such an a, superposition encoding at the cognitive 
transmitter and successive decoding at the primary receiver are capacity-achieving. 

I. Introduction 

The cognitive channel is a special case of an interference channel in which the second transmitter has complete 
and non-causal knowledge of the messages and codewords of the first transmitter This channel can be used to model 
an ideal operating scenario for cognitive radios, a device that can sense and adapt to the environment intelligently 
in coexistence with primary users. Fundamental limits of such a communication channel are of interest. Achievable 
rates of the cognitive channel was first obtained in [1] by merging Gel'fand-Pinsker coding pT] with the well-known 
Han-Kobayashi encoding |10| for the interference channel. At low interference, the capacity region of this channel 
in the Gaussian case has recently been established by Q and ||3) independently. While the former considers the 
Gaussian channel only, the latter studies the general discrete memoryless channel case, also called the interference 
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channel with degraded message set (IC-DMS). Cognitive channel capacity is also known for very strong interference, 
when both receivers can decode both messages |4|. At medium interference, the capacity is still an open problem, 
with some achievable rate regions presented in |j2), ||5), and ||6). 

The Z-interference channel (ZIC) is an interference channel in which only one receiver suffers from interference. 
Its capacity is also unknown even for the Gaussian channel, except for some special cases. From the capacity 
perspective, it is not important which transmitter interferes with the other in the ZIC. In a cognitive ZIC, however, 
due to asymmetric transmitters, two different ZIC are conceivable. One is with interference from the cognitive 
transmitter to the primary receiver, and the other from the primary transmitter to the cognitive receiver. While 



achievable rate regions for the first one have been studied recently in p3| , |16|, there has not been such an 
investigation for the second one. 

In this paper, we study the cognitive channel in general and apply the results to the Gaussian cognitive ZIC 
(GCZIC) in which the cognitive transmitter interferes with the primary receiver. The contribution can be summarized 
as follows. First, we introduce a new discrete memoryless cognitive interference channel (DM-CIC) in which the 
primary receiver is more capable than the secondary receiver We term it the more capable DM-CIC. Then, using 
superposition coding, we establish inner and outer bound on its capacity. We also define a strong interference 
condition and show that the proposed outer bound holds under this condition also. Implicitly, both inner and outer 
bounds are also valid for cognitive Z-interference channel that the interfered receiver is more capable than the other 
receiver 

Second, we show that at strong interference (a^ > 1), where a is the gain of interference link from secondary 
user to primary receiver, the outer bound is applicable to the Gaussian CIC, and thus to the GCZIC. Then we prove 
that in Gaussian noise channel, jointly Gaussian distribution is the optimum distribution for this outer bound; and 
therefore, we are able to compute this outer bound for the GCZIC. The outer bound is proven to be the best outer 
bound for the GCZIC at strong interference. 

Finally, we derive the Gaussian version of the achievable rate region and prove that when interference is highly 
strong, i.e., \a\ > \/l + Pi, the inner and outer bounds coincide. Thus, we establish the capacity region of the 
GCZIC at this range, and show that superposition coding is the capacity achieving scheme. For such a large a, 
superposition encoding at the cognitive transmitter and successive decoding at the primary receiver are capacity- 
achieving. 

The rest of paper is organized as follows. In Section [II] we discuss models for the Gaussian cognitive interference 
channel and the GCZIC as well as the existing capacity result for this channel at < 1. We also introduce the 



more capable DM-CIC in this section. In Section III we provide new inner and outer bounds on the capacity region 
of the DM-CIC. Then in Section IV we show that for > 1 we can apply the introduced inner and outer bounds 
to the GCZIC; and, we compute these bounds for this range. For \a\ > ^/l + Pi, we prove the outer bound is equal 
to the proposed achievable region; and thus, establish the capacity of the GCZIC. Section |V] concludes the paper. 
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Fig. 1. The DM-CIC with two private messages M\,M2. two inputs X\,X2, and two outputs Y\,Y2. When p(j/i , j/2 , 2:2) = 
p(y2\x2)p{yi\x-i,X2), the DM-CIC is converted to the DM-CZIC. 



II. Channel Models, definitions, and existing results 

The classical interference channel (IC) consists of two independent, non-cooperating pairs of transmitter and 
receiver, both communicating over the same channel and interfering each other A special case of the IC is the 
cognitive IC, also called an IC with degraded message sets (IC-DMS), in which a transmitter, the cognitive one, 
has non-causal knowledge of the messages and codewords to be transmitted by the other transmitter, the primary 
one. In this section we formally define this channel and some other derivative of that. 

A. Discrete memoryless more capable cognitive interference channel 

Consider the discrete memoryless cognitive interference channel (DM-CIC), also termed the discrete memoryless 
interference channel with degraded message sets (IC-DMS), depicted in Figure [T] where sender 1 wishes to transmit 
message Mi to receiver 1 and sender 2 wishes to transmit message A/2 to receiver 2. Message A/2 is available 
only at sender 2, while both senders know A/i. This channel is defined by a tuple (A*!, X2\p{yi, y2|a;i, a;2); 3^i, 3^i) 
where two inputs Xi,X2, and two outputs 3^i,3^i are related by a collection of conditional probability density 
functions p{yi,y2\xi, X2)- 

The discrete memoryless cognitive Z-interference channel (DM-CZIC) is a DM-CIC in which interference is one 
sided. More specifically, we consider the case where the primary user does not interfere the secondary one. This 
only affects the channel transition matrix. Thus, the DM-CZIC with two private messages A/i,A/2, for the two 
receivers, two inputs Xi,X2, and two outputs Yi,Y2 is a DM-CIC in which 

P{yi,y2\xi,X2) ^ p{y2\x2)p{yi\xi,X2) (1) 

for all p{xi,X2)- 

Definition 1. The DM-CIC is said to be more capable if 

I{Xi,X2;Yi)>I{XuX2;Y2) (2) 

for all p{xi,X2)- 
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Fig. 2. The Standard Gaussian cognitive cliannel with inputs Xi,X2, outputs Yi,Y2, and additive noises Zi,Z2. The dashed line shows 
the flow of non-causal information from the primary user to the cognitive user. 



Since the second transmitter can encode and broadcast both messages, in the absence of the first transmitter this 
channel reduces to the well-known more capable DM-BC. In the presence of first sender, this channel is no longer a 
BC but an interference channel (IC). However, due to cognition, the second transmitter has complete and non-causal 
knowledge of both messages and codewords; thus, it can act similarly to the BC's transmitter. This observation 
motivated us to define a condition similar to the one that makes one receiver more capable than the other one in a 
DM-BC. 

We also define another condition to identify that primary receiver is in a better situation than secondary receiver 
in receiving the signal of cognitive user. We name this strong cognitive inference condition, as it indicates, roughly 
speaking, the interference link from cognitive user to primary reciter is stronger the direct link of cognitive sender 
to its corresponding receiver. 

Definition 2. The DM-CIC is under strong cognitive interference if 

I{X2;Yi\Xi)>I{X2;Y2\Xi) (3) 

for all p{xi,X2)- 

Note that in general neither of these two definitions Q and Q implies the other one. 
B. Gaussian cognitive interference channels 



Without loss of generality, we use the standard form of the Gaussian interference channel |12|, p3) , in which 
the gains of both direct links are 1 and both noises are independent with unit variance. The standard Gaussian 
cognitive interference channel is shown in Figure |2] and is expressed as 

Yi^Xi+ bX2 + Zi 
Y2 = aXi +X2 + Z2 
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~ A/'(0, 1) 




Z2 ~ A/'(0, 1) 

Fig. 3. The Gaussian cognitive Z-interference channel (GZIC) 



Here the interference links are arbitrary constants a and h known at all the transmitters and receivers; Xi, X2 
represent the primary and secondary users' transmit signals, and Y\, Y2 their received signals; Zi, Z2 are independent 
additive noises Zi ^ A/'(0, 1) {i = 1,2). We also assume that transmitted signals are subject to average power 
constraint as E[Xl] < Pi and E[X^] < P2. 

Depending on the values of the interference links a and b, different classes of IC emerge. A special class is 
the Z-interference channel (ZIC) when either a = or 5 = 0. For a non-cognitive system, there is no difference 
in the capacity analysis of these two ZICs. In a cognitive system, however, due to asymmetric knowledge at the 
transmitters, two different cognitive ZICs are conceivable. One is when the primary receiver has no interference 
(a ~ 0), and the other is when the secondary receiver has no interference (6 = 0). These two GCZIC channels have 
completely different capacity regions. 

The capacity of the GCZIC with a = can be simply obtained from the well-known result of dirty paper coding 
by Costa ||8|. Achievable rate and capacity regions of this cognitive ZIC for the discrete memoryless case can also 



be found in 1 15|. On the other hand, to the best of our knowledge, not much work has been done on the second 
GCZIC (with b = 0). In this paper, we investigate the capacity region for this GCZIC with & = 0. In the rest of 
this paper, GCZIC refers to this channel. 

In the following sections, we establish an inner bound and an outer bound for the DM-CIC satisfying either (|2]) 
or ([3]). These bounds are valid for the DM-CZIC as well. Later, we will use these bounds to prove the capacity of 
the GCZIC with very strong interference. 

III. Inner and Outer bounds on the capacity of the more capable DM-CIC 

In the first part of this section, we derive an achievable rate region for the DM-CIC. In the second part, we 
introduce a new outer bound on the capacity of the more capable DM-CIC which is valid for DM-CIC with strong 
interference also. Since the more capable DM-CIC is an extension of the more capable DM-BC, the achievable 
region also is an extension of its component's capacity region. The technique used for achievability is the same as 
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capacity achieving technique for the conventional more capable DM-BC. In addition, the outer bound also resembles 
that of the more capable DM-BC. Similarly, achievability, error analysis, and the proof of converse (outer bound) 
follow those of the DM-BC. Nevertheless, in general the inner bound and the outer bound are not equal in the 
more capable DM-CIC while they are proven to be the same for the more capable DM-BC. Indeed, this difference, 
which will be addressed later in this section, prevents establishing capacity region for the more capable DM-CIC. 

A. A new achievable rate region 

Theorem 1 provides an achievable region for the DM-CIC. The achievable technique uses superposition encoding 
at the cognitive transmitter The decoding is based on the joint typicality. 

Theorem 1. An achievable rate region for the DM-CIC consist of all rate pairs (i?i,i?2) that satisfy 

Ri < I{Xi-Yi\U) 
R2<I{U;Y2) 

Ri+R2<I{X,,X2;Y,) (4) 

for some joint distributions that factors as p{u)p{xi)x2{xi,u)p{y2\x2)p{yi\xi, X2), where X2{xi,u) is a function 
which can be random or deterministic. 

The proof uses the superposition coding idea in which Y2 can only decode M2 (the cloud center) while Yi is 
intended to decode the satellite codeword. For completeness we provide the proof in the appendix A. 

B. More capable BC capacity inspired outer bound 

Inspired by capacity of more capable BC p4| , p9| , instead of proving the outer bound for region (|4]), we prove 
it for the slightly altered rate region below. The following outer bound on the capacity holds both for the more 
capable DM-CIC and DM-CIC with strong interference. 

Theorem 2. The union of all rate pairs (i?i,i?2) such that 

R2<I{U;Y2) 
Ri + R2<I{Xi;Yi\U)+I{U;Y2) 

Ri + R2<HXi,X2;Yi) (5) 

for some joint distributions p(m, Xi, X2)j3(yi, ?;2|2;i, 2^2) constitutes an outer bound on the capacity region of a 
DM-CIC satisfying either the more capable condition in Q or strong interference condition in ([3]l. 

The proof is based on the proof of converse for the more capable BC in p4) but adapted for the DM-CIC. For 
completeness, we provide the proof in the appendix B. In the BC, this new form is shown to be an alternative 
representation of the rate region in Theorem [T]|14|; thus, proving the converse for this equivalent region establishes 
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the capacity of the more capable BC. However, these two regions are not equivalent for DM-CIC because of different 
input distributions. Therefore, Theorem |2] provides only an outer bound for the capacity of the more capable DM- 
CIC and DM-CZIC. Nevertheless, later in this paper we show that this outer bound is tight for the GCZIC at very 
strong interference. 

IV. Bounds and capacity of the Gaussian cognitive Z channel at strong interference 

The GCZIC at weak interference (a^ < 1) is a special case of the Gaussian cognitive interference channel (when 
b = 0), for which the capacity region is known for a < 1 and any real b The cognitive user partially devotes its 
power to help send the codeword of the primary user. It dirty paper encodes its own codeword against the codeword 
of the primary user. The cognitive receiver performs dirty paper decoding to extract its message free of interference 
Q and lH}. At strong interference regime (a^ > 1) however, the capacity of the GCZIC is not known in general. 
An outer bound on the capacity of the Gaussian cognitive IC was established by Marie et al. in |4|, Corollary 1. 

In this section, we first find the condition in which the GCZIC is a more capable or under strong interference. 
We show that for the Gaussian CIC, strong interference conditions is equivalent to > 1. Thus for > 1, we 



provide new inner and outer bounds by evaluating the inner and outer bounds in Section III Finally, we prove 
that these inner and outer bounds coincide when the interference is very strong (|a| > ^/l + Pi), thus establish the 
capacity of the GCZIC for this range of interference. 

A. More capable and strong interference conditions for the GCZIC 

In this section we explore the conditions for which Theorem |2] holds for the GCZIC; i.e, we find the condition 
that the GCZIC is either more capable (|2]) or under strong interference ([3]). 

Intuitively, the GCZIC is more capable when interference is very strong, considers the equivalent channel in 
Figure |4] which is achieved by manipulating Figure [3] Since both figures have the same Y2, and Yi is a scaled 
transformation of Yi {Yi — aYi), the channels depicted in these figures are equivalent from capacity point of view. 
The equivalent channel in Figure |4] looks like a broadcast channel if we consider Xi/a as interference. Without 
Xi/a this channel is a degraded BC and its capacity is known. Now, considering the interference Xi/a as noise, 
and assuming that a is large enough that the power associated with noise plus interference {Zi/a + Xi/a) is less 
than noise power at Y2, then Yi can be more capable than 1^2 ■ 

We need to find the range of a for which Yi in Figure [4] (or equivalently Yi in Figure [sj is more capable than 
Y2 in decoding X2- The condition (j2|i is equivalent to I{X2t Xi]Yi) > I{X2]Y2) because of channel transition 
matrix at ([T]i. For the GCZIC, this is equivalent to 

h{Xi +aX2 + Zi)> h{X2 + Z2) (6) 

for all p{xi,X2)- With jointly Gaussian Xi,X2 with correlation factor pi2 then ^ becomes 

Pi + a^P2 + 2api2x/PiP2 + 1 > P2 + 1. 
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Fig. 4. The equivalent cliannel to the GCZIC in Figure [3|, with inputs Xi,X2, outputs Yi = Yi/a,Y2, and effective additive Gaussian 
noises Zi/a and Z2. 

Choosing pi2 = —1, which is the worst case, the GCZIC is more capable if |a| > 1 + \/P\lPi- It is not clear, 
however, if this condition implies more capability. 

Let's now find the range of a for which the strong interference condition Q holds for the GCZIC. The proof 
follows directly from the strong interference condition for the Gaussian cognitive IC p9| , which shows that condition 
(|3]i is equivalent to 

h{aX2 + Zi|Xi) > h{X2 + Z2IX1) 

^o?>\. (7) 

We can draw the conclusion that the strong interference condition ([3]) implies the more capability condition (|2]i for 
any Gaussian cognitive IC. This completes the proof that Theorem [2] is applicable for the Gaussian CIC, and thus 
for the GCZIC, if |a| > 1. 

B. A new outer bound on the capacity of the GCZIC at strong interference 

The capacity of the GCZIC is partially unknown for strong interference (a^ > 1). At this regime, the best outer 
bound on the capacity the GCZIC was established in ff\. Corollary 1. In this section, we provide a new outer bound 
for the capacity of the GCZIC at strong interference. This outer bound is the Gaussian version of the outer bound 
in Theorem|2] with the exti-a inequahty R2 < I{X2;Y2\Xi) ||4) to that. 

Lemma 1. Any achievable rate pair (i?i,i?2) of the GCZIC with a? > 1, is upper bounded by the following 
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Fig. 5. Comparison of new outer bound in Lemma^and the best existing outer bound in |7J, for tlie GCZIC witli Pi = P2 = 6, \a\ = 4. 



constraints 



1 



i?i + i?2 < - log ( 1 + (^(1 - pi)Pi + \a\^{l - pI)P2I 



i?i + i?2 < ^ log (^1 + Pi + a'P2 + 2\a\^pl^PiP2 

i?2< ^l0g(l + (l-p?2)P2) 



where \pi2 - P1P2I < \/ - Pi)(l - pI), and < 1,* = 1,2. 



log 



I + P2 
1 + P2{1-pI) 



(8) 
(9) 
(10) 
(11) 



The proof of this lemma involves showing that the jointly Gaussian distribution is the optimum distribution and 
evaluating the outer bound in Theorem |2] for the GCZIC, then finding the covariance matrix of jointly Gaussian 
Xi , X2 , U to maximize the RHS of all inequalities in Theorem [T] Details of evaluation and maximization can be 
found in Appendix C. 

From the proof, it can be seen that without ( [TT] i, optimality condition is |pi2 — P1P2I = •\/(l — )(1 — P2) which 
is achieved when h{X2\U, Xi) = 0, and implies that X2 is a function of {U,Xi). This condition is the same as 
that in the inner bound. However, the inner bound applies to independent Xi and U whereas the outer bound is 
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for general Xi and U. In Section IV-D we show that for a certain range of a (a^ > 1 + Pi) the optimal Xi and 
U for the outer bound are also independent, thus establish the capacity at that range. 

Figure |5] numerically compares the outer bound proposed in Lemma [T] with the best existing outer bound in p\. 
It shows that new outer bound is strictly better than the existing one. Moreover, as \a\ becomes larger, the gap 
between these two bounds increases, and the outer bound in Lemma [T] gets much closer to the achievable region. 
This lemma establishes the best outer bound on the capacity region of the GCZIC at strong inference (a^ > 1). 
We prove this claim by simplifying the outer bound in Lemma [T] into three simpler outer bounds in the following 
Corollaries. To do so, we first introduce a = I — pf, f3 — I — P2,'^ = Pi2, x ~ 1 — x, and x E [0, 1] to make the 
bounds easier to read. These Corollaries give the capacity of the desired channel for two disjoint ranges of a. 

Corollary 1. Any achievable rate pair R2) of the GCZIC, is upper bounded by the convex hull of the following 
region 

Ri+R2<\ log {1 + P1+ a^P2 + 2|a| v/tPT^) 

i?2 < ^ log (1+7^2) (12) 

where 7 G [0, 1]. 

This follows immediately by removing ([8]), (|9| from the Lemma [T] This is the same as the best existing outer bound 
in |]7|, and our claim that Lemma [T] provides the best outer bound follows readily. It should be highlighted that, 
this region has been recently proven in |l7| and |l8 | to be the capacity of the GCZIC for 



1< a < ,/l ' ^1 



1+P2 ■ 

Corollary 2. Any achievable rate pair R2) of the GCZIC, is upper bounded by the convex hull of the following 
region 

1 , / 01P2 
i?2 < - log IH — 

Ri+R2<\ log (1 + ( 7^ + |a| v^)') + ^ log (1 + ^ "^'^^ ) (13) 



where a G [0, 1]. 



Similar to the Corollary [T] we first eliminate ( [TO| l, ( [TT| ) in the Lemma [T| Then, the proof of is immediate by looking 
at (|9]l, and is achieved for pi = (/3 = 1). As we will show later in Section IV-D Corollary |2] is also the capacity 
region for the GCZIC when interference gain is considerably large (|a| > \/PiP2 + \/l + Pi + PiP2)- 

Corollary 3. Any achievable rate pair R2) of the GCZIC, is upper bounded by the convex hull of the following 
region 

i?2 < ^ log ( 1 



2 ° V 1 + aPa 
Ri+R2< hogfl + (y^+|a|v^)') +^logfl 



2 ° V ^ V , . , , V J 2 °\ l + aP2 

i?i + i?2 < ^ log (1 + Pi + a^P2 + 2|a|(V^ + ya^)/P^) (14) 
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where a, l3 ^ [0, 1]. 

To prove this, we first remove (H) in the Lemma [T| Then, in ( [TO] i we use api2 < |a||pi2| < |a|(|piP2| + 
•\/(l — Pi)(l — P2)) where the last inequality follows applying triangle inequality — y|>|a;| — |y| with x ~ pi2 
and y = pip2 to the LHS of \pi2 — /O1P2I < ■\/(l — )(1 — /92)- The maximum is attained when /91/92 has the 
same sign with a. Finally, the outer bound in Corollary [sjis obtained considering that a ^ \ — p^, j3 ^ 1 — p\. 
Similar to Corollary |2] in Section IV-D we will show that for (i — \, Corollary |3] results in the capacity region of 



the GCZIC when \a\> yj\ + Pi. 



C. Superposition coding-based inner bound for the GCZIC 



In this section, we compute the Gaussian version of the achievable region introduced in Section III-A for the 
GCZIC. Following lemma, which extends Theorem[T]to the GCZIC, provides the achievable region by superposition 
coding. 

Lemma 2. Any rate pair {Rx^K^) satisfying 

Ri < ^log(l + (/f^ + ayai^)2 

1 , / aP^ 
i?2 < 7^ log 1 



2 ° V 1 + a-P2 

Ri+R2<l log (1 + Pi + a'P2 + 2aVaPiP2) (15) 
with a € [0, 1] is achievable for the GCZIC. 

Proof: The achievability of this region is straightforward by Theorem [T] In the proof of Lemma [T] we have 
shown that jointly Gaussian input is the optimum input to maximize I{Xi; Yi\U),I{U; I2), and I{Xi,X2; Yi). The 
cognitive user partially uses its power to help send the codewords of the primary user. X2 contains two independent 
Gaussian parts, X2 ~ \/a]^V{ini) + y/dHyj {171,2) ■ The primary user dedicates its whole power to transmit mi, as 
Xi = \/PiV (mi) . Decoding in the primary receiver is based on successive cancelation, where it first decode and 
subtract the cognitive user's codeword in order to decode its own codeword free of interference. Cognitive receiver 
simply decodes its own codeword assuming the other codeword as interference. ■ 



This achievable region even simplifies when the interference gain is substantially large, that is \a\ > \JP\P2 + 
\/\ + Pi + P1P2. In the following section we show that, when interference is very strong (|a| > \/\ + Pi), the 
inner bound in Lemma |2] gives the capacity of the desired channel. 

D. Capacity of the GCZIC at very strong interference 

In this section, we prove that superposition coding achieves the capacity of the GCZIC for \a\ > \/l + Pi. 
Specifically, we show that in Corollary |3] /3 has to be 1 for this range of a. Hence, the outer bound coincides with 
the inner bound in Lemma |2] and gives the capacity of the desired channel. 
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Theorem 3. The capacity region the GCZIC for \a\ > \/l + Pi is the set of all rate pairs (i?i,i?2) satisfying 

^^^'M'^ifk) '''' 

Ri+R2< I log (l + (Ti^ + |a| + I log (l + ^ "^'^^ ) (17) 
i?i + i?2 < ^ log (l + Pi + a^Pa + 2|a| VaPiPa) (18) 

/or a e [0, 1]. 

Proof We want to show that for \a\ > \/l + Pi the outer bound in Corollary |3] and the inner bound in 
Lemma [2] coincide. Let's define TZi as the set of all rate pairs satisfying the constraints in Lemma [2] and TZo as the 
set of all rate pairs satisfying ([T6]l-([T8]l. Using the same argument as El Gamal p4) , we can show that TZi = T^o- 
Here R2 can be thought of as the rate of common message which can be decoded at both receivers while Pi is 
the rate of the private message. Now {R2, Ri) G TZo if and only if {R2 — t, Ri + t) G TZo for any < i < Pa- 
This means the common rate (P2) can be partially or entirely private. Thus region TZo can be represented as TZi. 

We next show that the outer bound in Corollary [3] simplifies to TZo- To do so, it suffices to show that for 
|a| > \/l + Pi, 13 has to be 1 in Corollary [s] Consider the first two inequalities of the outer bound in Corollary [s] 
we can see that on the boundary of this outer bound we must have 

R2<llog(l + {^pi\ + \a\V^2f) (19) 



2 

Comparing this inequality with the first inequity in TZi, we conclude that either /? has to be 1 or the first inequity 
of TZi must be loose; since otherwise the outer bound is less that the inner bound, which is not possible. For this 
inequality to be redundant in TZi we need 

i log (1 + Pi + a^P2 + 2|a| VaPiPa) < ^ log (^1 + ^ "^'^J + ^ log (l + (v^ + |a| v^)^) 
^ - log 1 + ^ , ^ ^ , , < - log 1 



2 °V l + (/f^+|a| 7^)2 7 2 "V l + aP2. 

^ \a\ < ^JaPiP2 + v/1 + Pi + aPiP2 (20) 

For ( |20] i to hold with any a then 

\a\ < Vl + Pi- (21) 

This implies the first inequality of Lemma [2] is redundant only for |a| < ^/T^f7l. In other words, if |a| > vT+^Pl^ 
there exist some a for which this inequality cannot be redundant; this in turn enforces (H — 1. Thus, for this range 
of a, the outer bound in Corollary |3] simplifies to TZo- Note also that with = 1 (pi = 0) the optimal input for the 
outer bound is the same as the input for the inner bound (i.e., Xi, U are independent and X2 = X2{U,Xi), thus 
the capacity is established. 

■ 

As a special case, when the third constraint is redundant in TZi and TZo, the outer bound in Corollary |2] is tight. 
For such an a, the capacity region in Theorem [5] further simplifies as below. 
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Corollary 4. The capacity region the GCZIC for \a\ > P1P2 + \/l + -Pi + -Pi/2 is the set of all rate pairs 
(Pi,i?2) satisfying 



1 



Ri < - log l + (VPi + |a|VaPs 



2 

P2 < ^ log f 1 + ) (22) 

^ - 2 ^ V 1 + aP2 / 

/or a e [0, 1]. 

Proof: First consider the achievable rate region in Lemma |2] The third inequaUty becomes redundant if 



2) 



^2 



2 ° V ' ' ^ " V - 2 ° V 1 + aP, y 2 

1 / a^aPa 
=^ a log 1 



l + (\/Pr+|a|%/^)2 





1 


log( 






«P2 \ 


2 


1 


+ aP2/ 




1 


log ( 






aP2 \ 


2 


1 


+ aP2/ 



log l + (v/Pi + |a|v/aP; 



2 



^ \a\ > VaPiP2 + V'^ + Pi + aPiP2 (23) 

For (|23]l to hold with any a then 



\a\ > VPvP^ + ^l + Pi+PiP2. (24) 

On the other hand, without the third inequality, the achievable region in Corollary |4] is also equal to the outer bound 
in the Corollary |2] This is because, for the same value of a, any point on the boundary of this region is on the 
boundary of the region in Corollary |2] and vice versa. Hence, we obtain the capacity region in Corollary |4] if (|24]) 
holds for any a, i.e., \a\ > + VTTT\n\K. ■ 

Corollary |4]provides a special case of Theorem |3] with simpler rate expressions. In the concurrent and independent 
work | [20) , "Theorem V.3. Capacity for S-G-CIFC" archives the same capacity result in Corollary |4] using a different 
approach. The achievability follows from a more general DPC-based scheme for the Gaussian cognitive interference 
channel. Although the achievability seems to be based on DPC, a close observation reveals that DPC is not necessary 
to achieve the capacity. This is because the parameter of DPC is zero (A = 0) which means that, in effect, DPC 
has no contribution; thus, the achievability scheme reduces to superposition coding. The outer bound is completely 
different and is based on the MIMO-BC outer bound pO| . 

Theorem[3]shows that, when the interference is very strong, the interfered primary receiver can decode the message 
of the interfering cognitive transmitter in a rate higher than its own receiver This is as though the interference 
link from the cognitive transmitter to the primary receiver is less noisy than the direct link for the cognitive user's 
message. This sheds light on the optimal coding scheme as well. That is, the primary user encodes independently 
while the cognitive user encodes by superimposing the primary user's codeword on its own. Then, the cognitive 
receiver decodes its message treating primary user's codeword as noise. The primary receiver, on the other hand, 
performs successive cancellation; it first decodes the cognitive user's message, then subtracts it from the received 
signal to decode its own message free of interference. 
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TABLE I 

Summary of all capacity results for the GCZIC 



Range of a 


Capacity region 


Capacity acliieving 
technique 


Reference 


\a\ < 1 


/?i < i log (1 + 
^2 < 1 lo 


{yPr+|a|v/SP5)2 \ 

g{l + aP2) 


superposition coding 
and DPC 


M 


i<kl<^i + x|^ 


Ri+R2<^ log (1 + P1+ a^P2 + 2|a|VoPift) 
R2 < |log(l + aP2) 


superposition coding 
and DPC 


17 , 


isj 


\a\ <y/l + Pl 


(Lemma 1 


unknown 
gives the best outer bound) 


unknown 






Rl < 5 log (1 + ( 


v^+|a|v^)2) 


superposition coding 


Theorem jsj 


\a\ >y/l + Pl 


Ri+R2<^ log 


2 iog 
(l + P 


^ I + QP2 J 

i+a^P2 + 2\a\VaPiP2) 


\a\ > VP1P2 + V'i- + Pi+ P1P2 


iJi < 1 log (1 + ( 
R2 < 1 log 


v^+|a|vOT^)2) 


superposition coding 


Corollar 
20 


4 



V. Summary 

Analysis of the capacity results of the GCZIC shows that superposition coding appears to be an indispensable 
tool in any capacity-achieving techniques for this channel. At very strong interference, superposition coding single- 
handedly achieves its capacity. However, both DPC and superposition coding are needed to establish the capacity 
when interference is weak or intermediate. Table I summarizes the capacity results for the GCZIC. As it can be 
seen, up to now, the capacity of this channel is characterized except for y^l + < a < \/l + Pi. For this range 

of a, a more general and inclusive form of the proposed capacity-achieving outer bounds at \a\ > 1, as represented 
in Lemma [T] provides the best outer bound on the capacity region of the GCZIC. 
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VI. Appendix 

A. Proof of Theorem [7] 

Proof: We prove this theorem by showing the encoding, decoding, and error analysis. 

1) Code construction and encoding: Fix p{xi), p{u) and p{x2\xi,u) that achieve capacity. Randomly and 
independently generate 2"-'^^ sequences x"(mi), mi e [1 : 2"-'^^] i.i.d. according to n"=iP(^ii)- Also, randomly 
and independently generate 2"^^ sequences M"(m2), m2 G [1 : 2"^^] with elements i.i.d. according to nr=i Pui^i). 
Next, for each pair of sequences (u" (7712), (mi)), randomly and conditionally independently generate one 
sequence X2{mi,m2) with elements i.i.d. according to YYi=iPx2\Xiu{x2i\xii{'mi)ui{m2)). 

For encoding, to send messages (mi,m2), the primary transmitter just sends the codeword ^"(mi) and the 
secondary transmitter sends the codeword X2{mi,m2) corresponding to those messages. 

2) Decoding: Decoding is based on standard joint typicality. The less capable receiver (1^2) can only distinguish 
the auxiliary random variable U. Decoder 2 declares that message m2 is sent if it is a unique message such that 
(u"(m2), 2/2) G T^^""* ; otherwise it declares an error Decoder 1 declares that message mi is sent if it is a unique 
message such that (u"(m2), (mi), a::2 (m-i, m2), 2/2 ) ^ 7^*'"'' for some m2; otherwise it declares an error 

3) Error analysis: To analyze the probability of error, without loss of generality, assume that {Mi, M2) — (1, 1) 
is sent. Fkst we consider the average probability of error for decoder 2. Let's define the error events 

f2i = (c/"(i),r2")^r/") 

£22 = (t/"(m2), rj') e for some m2 ^ 1 (25) 
By union bound, the probability of error for decoder 2 is upper bounded by 

£2 = P{£21 U £22) < P{£2l) + P{£22)- (26) 

Now by law of large numbers (LLN) P(£2i) — ^ as n ^ 00. Also, since C/"(m2) is independent of (J7"(l),y2") 
for ma 7^ 1 by the packing lemma |[l9j P{£22) ^ as n ^ 00 if i?2 < I{U] Y2) - (5(e). 

Then, consider the average probability of error for decoder 1. We define the following error events 

f 11 =(Xi"(i), [/"(!), X2"(i, 1), Yi") i r,(") 
£12 =(Xi"(mi),;7"(i),x«(mi,i),ri") e ri") 

for some mi 7^ 1 
fi3 =(Xi"(mi),[/"(m2),X2"(mi,m2),ri") G T.^") 

for some mi 7^ l,m2 7^ 1. (27) 
Using union bound, the probability of error for decoder 2 is upper bounded by 

£2 = P{£ii u £12 u £13) < Pi£ii) + P{£i2) + P{£i3) (28) 

Now we evaluate each term in the right-hand side (RHS) of this inequality when n — ^ 00. First consider £11; again 
by LLN P{£ii) — > as n — > 00. Next consider £12. For mi 7^ 1 since (Xi(mi), X2(mi, 1)) is conditionally 
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independent of given ?7"(1), by packing lemma P{£i2) — > as n — > oo given i?i < I{Xi, X2;Yi\U) — 
d{e) = I{Xi;Yi\U) — (5(e) because X2 is a function of {U,Xi). Finally consider ^13. For nii 7^ 1 and m2 1, 
(Xi(mi), t/"(m2), X2(mi, m2)) is independent of Y"; hence, by packing lemma P{£i3) — > as n — > 00 if _Ri + 
R2 < IiXi,U,X2;Yi) ~ (5(e) = I{Xi,X2]Yi) - (5(e). The equality follows because U Xi,X2 ^ Fi forms 
a Markov chain. The above analysis completes the proof of achievability since it shows that both receivers can 
decode corresponding messages with the total probability of error tending to zero if Q is satisfied. Therefore, there 
exists a sequence of codes with error probability tending to 0. ■ 

B. Proof of Theorem |2] 

1) For more capable DM-CIC: The proof is also similar to the converse proof for the more capable broadcast 



channel p4) . We follow the same line of proof as in 1 19|; the only difference is replacing Xi in ]T9) with {Xi, X2), 
since here X2 also encodes Mi. 

We can bound the rates R2 and R2 + R2 as 

nR2 = H{M2) 

= I(M2-,Y^) + H(M2\Y^) 

<I{M2:Y^)+nei^ (29) 

and 

n{Ri+R2)^H{Mi,M2) 

^ H{Mi\M2) + H{M2) 

= I{A'h;Y^'\M2) + H{Mi\Y{',M2)+I{M2;Y2n+H{M2\Y2n 
< I{M^;Y^\M2) + I{M2; Y^) + ne2„ (30) 
where (|29]l and pO| follow by Fano's inequality. In a very similar fashion, sum rate can be also bounded by 

n{Ri + R2) < I{M2;Yi'\Kh) + /(Mi; Fi") + ne3„. (31) 

Now we manipulate the RHS of (|29|-([3TJ to obtain the desired terms in (|5]l. First, consider the mutual information 
term in (|29| 

n 

/(M2; ^2") = ^(^'^2; Y2,\Yi],^^) (32) 

Z^l 

n 

<^/(M2,r2"+i;>'2.) 

71 

<J2l{M2,Yr\Y2%,;Y2.) 

71 

^J2l{U,;Y2^) (33) 
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in which ( 32 1 follows from the chain rule, and we have defined the auxiUary random variable Ui = {Al2 ^ , ^2"!+! ) 



moving to (33i. Next, we bound the mutual information terms of the second inequality in pO| ). 

n n 

I{Nh]Y^\M2) + /(M2; Y^) = I{Mi;Yu\M2, Yr') + ^ /(Afa; Fs.lK," +1) 



< J2 HM^,Y^^,_,,;Yu\M2,Yr') + J2 ^(^2, l^^+i; F^.) 

n 

n n 

+ J2 I{M2,Y-,^„Yr';Y2.) I{Yr';YM,Y2%,) 

n n 

= Y I{M,;Yu\M2, F2" +1) + E ^(^2, r^i; 

n n 

i—l i 

n n 

= E I{M,;Y,,\U,) + E I{U.; Y2i) 

n n 

< J2 I{Xu; Yu\U,) + E I{U^;Y2^) 



i=l 



(34) 
(35) 



where (34i follows by the Csiszar sum identity and the auxiliary random variable Ui = {M2,Yl ^j^2"i+i)' '35 1 



follows by Markov chain Mi i^iU) Yi. 

For the third inequality, following steps similar to the bound for the second inequality and defining Vi 

as 



(Ml, Yj* ;^2"i+i) '^^'1 bound the mutual information terms in (31 



/(M2; r2"|Mi) + /(Mi; Yi") = E I{M2;Y2,\Mi,Yr') + ^ HMi;Y2^\Y^-') 

n n 
<YIi^■^^'Y2^\Vi)+Y,IiV^■^Y2^) (36) 
i=l 1=1 
n n 

< E HXu, X2^;Y2,\V{) + E I{y^■^Yli) (37) 

1=1 i=l 
n n 

< J2 HXu, X2^■,Yu\Vi) + J2 IiV^■,Yu) (38) 

n 

— E/ /("'^li' "'^Zi; Yii) 



1=1 



in which (36 1 follows similar steps to the bound for the first inequality on sum rate; (37i follows from Af2 



Xi, ^2 ^ Y2; and ^ follows by ^ that gives /(Xi, X2; Yi) > /(Xi, X2; ^2), and implies that /(Xi, X2; Yaly) < 
/(Xi,X2;Yi|F). 

Next, we define the time sharing random variable Q which is uniformly distributed over [1 : n] and is independent 
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Of (Mi,M2,Xi",X2",ri",y2"). Also we define U = {Q,Uq),X^ = X^^.X^i = XJ^^.Y^^ = Y^^.Y^ = Y^q. 
Then we have 



Similarly 



^i?2 <^I{U^■,Y2^)+ntln 
i=l 

= nI{U;Y2\Q)+nein 
< nIiU;Y2) +nein 



and 



i=l i=l 

= nI{Xi;Yi\U,Q) + nIiU;Y2\Q) + ne2n 
= nI{Xi ;Yi\U) + nI{U; ^2 IQ) + ^^e2n 
< n/(Xi ;Yi\U) + nI{U; Y2) + ?ie2„ 

n 

n{Ri +R2)<Y^ I{Xu, X2^■, Yu) + nesn- 

i=l 

= nI{Xi,Xi;Yi\Q)+ne3n 
< nI{Xi,Xi;Yi)+ne3r, 

As n 00, ei„,e2n, and e3„ tend to zero because the probability of error is assumed to vanish. This completes 
the proof for the more capable DM-CIC. 

2) For DM-CIC with strong interference: Proof: The proof outer bound under the strong interference 
condition ([3| is almost the same, with only slight difference in the proof of last inequality. This is because the 
first two inequalities hold for any DM-CIC. Under the strong interference condition ([5]) we can we can bound the 
mutual information terms in pTj i as 

/(M2; l^l^^i) + ^(Mi; Fi") < /(M2; ig'iMi, + /(Ml, Xi"; Y^) 

< I{X'2';Y2^\Mi,X^) + /(Ml, Xi"; Yi") 

< /(X^'; Yi"|Mi, Xi") + /(Ml, Xi"; Fi") (39) 
= I{Xl\X^;Yn 

n 

< liXii, X2i] Yii) 
i=l 

in which ^ follows by ^ that gives /(X2;r2|^i) < /(X2;Yi|Xi), and implies that /(X2; ^2!^!, Mi) < 



/(X2; Fl|Xi, Ml). The other steps are straightforward. Finally, this proves that Theorem |2] holds both for more 
capable strong and interference DM-CIC. ■ 
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C. Proof of Lemma |7] 

Proof We need to find the distribution that maximize the rate region in Theorem |2] for the Gaussian channel. 
In what follows, we show that jointly Gaussian Xi,X2,U is optimum, i.e., it provides the largest outer bound for 
the Gaussian channel. By maximum entropy theorem, the RHS of the third inequality is maximized when Yi is 
Gaussian, thus Yi — Xi + aX2 + Z\ is Gaussian; that is X\ + aX^ must be Gaussian. 
Similarly, 

= h[Y2)-h{Y2\U) 
<h{Y^)~h{Y2\U) 

< Ky?) - \ log (22''(^^l^) + 22'*(^^)) (40) 

where Y2 denotes Y2 when the inputs X\ , X2 are Gaussian. The last inequahty follows by conditional version 
of entropy power inequality (EPI) for which equality is achieved when X2\U is Gaussian . Likewise, for the term 
l{Xx\Y]\U) we can write 

I(Xr,Y^\\J)^h{Yr\V)-h{Y^\V,X^) 

< h{X^\U'^ + aX^\U'^ + Zi) - h{Yi\U,Xi) 

< |C/^ + aX^p"" + Zi) - ^ log (2^HaX,\u,x,) ^ ^2h(z,)^ 

where where X^ ,U'^ denoteXi,f7 when the inputs Xi,X2 are Gaussian, and inequalities follow by maximum 
entropy theorem and conditional version of EPI, respectively. Again equality is achieved when all terms are Gaussian. 
Hence, all inequalities in the outer bound are maximized with jointly Gaussian Xi, X2, U . 

Now the problem is to find the optimum covariance matrix to maximize the bounds, i.e., to determine correlation 
coefficients among Xi,X2, and U. Let {U,Xi,X2) ^ JV{0,K), which are correlated Gaussian random variables 
Xi,X2, and U with covariance matrix 

f Pu PiVlW P2VWP2 \ 

PlVWPl Pi P12^/T\P2 



K = COV{U,Xi,X2 



(41) 



\ P2^TW2 P12^IW2 P2 ) 

Since the covariance matrix is positive semidefinite, the determinant of this matrix must be nonnegative. That is 

1 - P?2 - + 2piP2Pi2 - P2 > 0. (42) 

The inequality holds if 

piP2 - ^{i-pI){i-pD < pi2 < P1P2 + ^{i- pI){i- pI) 

or equivalently the covariance matrix is positive semidefinite if 

\Pi2 - PiP2\ < ^J{1-pI){1-pI). (43) 
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Now we evaluate the rate constraints defining the outer bound in Theorem |2] 

R2<IiU;Y2) 

= h{Y2) + h{U) - h{Y2,U) 

= h{X2 + Z2) + h[U) - h{X2 + Z2,U) 

= ^log2^e(l + P2) + ^log2^e (Pc/) - ^ log (2^e)2|ifi| 



1 



log 



I + P2 



1+P2(1-P^) 



where 



Similarly 



Ki = 



1 + P2 P2VP1JP2 
P2VPUP2 Pu 



I{Xi-Y^\U) ^ h{Y^\U) - h{Y^\U,X^) 

= h{Y^, U) - h{U) - \ log {2^KaX,\v.x,) ^ ^2h(z,) 

<h{Y^,U)-h{U)-\\og (22''(^^)) 

^\\og{\K2\/Pu) 



where 



\K2 



1 + Pi+a^P2 + 2api2VPJ^ PiVPuT\ + ap2VPuP^ 
piVPuPi + api2VPuP2 Pu 

Pu{l + Pi+ a^P2 + 2api2yP^ - pfPi - a^plP2 - 2apip2^/PD^2) 



To check if the inequality (45i can hold with equality, we evaluate the term 2^'*(°"^2|;7,J>s:i) 
h{X2\U,Xi)^h{U,Xi,X2)-h{U,Xi) 

= ^ log {{27:ef\K\) ~ ^ log {{27: e)' Pu Pi (l - pD) 



1 



log 27reP2 



1 - P12 - Pi - P2 + 2pi2P2Pi 



(44) 



(45) 



(46) 



(47) 



in which the covariance matrix K is defined by (41 1. Since both numerator and denominator are nonnegative in 
(45 I, the argument of this function is either zero or positive. Therefore, min 

^22hiaX2\u.x^)^ = is achieved when 

1- PI2- pj- pl + 2pi2P2Pi = 0, or equivalently, |pi2-piP2| = V(l - - pD- Note that /i(aX2|[/, Xi) = 
implies X2 to be a function of {U,Xi). 



Keeping this in mind that \pi2 — P1P2I = ^(1 — — P2) is optimum condition for (45 1, we evaluate 
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I{Xi_]Yi\U) as follows. 



I{Xi;Y,\U) < hogi\K2\/Pu) 



I log f 1 + (1 - pI)Pi + a\l - pI)P2 + 2a(pi2 - pm)^^ 



< ^ log (l + (1 - pI)Pi + a^l - pj)P2 + 2\a\\pi2 ~ PiP2\^I\P2 



< ^ log ( 1 + (1 - pI)P, + a^{l - pI)P2 + 2|a|^/(l - - p^P.P^ 



where, the last inequahty follows from (43 1. Interestingly, again 

\pi2 - P1P2 1 = ^{i-pIKi-pD (48) 

turns out to be the optimum condition. From this two values for pi2 are plausible which are respectively 

p[\^ = PiP2 + \J {I - pI){1 - pD (49) 



P?^ ^PiP2-^{1-pI){1-pI) (50) 
Then, it is also straightforward to calculate the third bound in Theorem |2] to obtain 

Ri+R2<l: log f 1 + Pi + a^P2 + 2api2^PvP2' 



< i log (l + Pi + a^P2 + 2 m&x{ap[^^ ,ap^^^^}^/P^2 
Therefore, from the outer bound in Theorem |2j we compute the rate region TZq with following constraints 

R2 < ^ log 



2 Vl + P2(l-Pi) 

Pi+i?2 < I log (^1 + (1 - pI)Pi + a'(l - pI)P2 + 2\a\^{l - pl){l - pl)P^P2 

.1, / I + P2 
log 



2 ^\l + P2{l~pl) 
Ri+R2<\ log (1 + Pi + a^P2 + 2 max{a/9^2\ apfa^l/Pi^ 

As a last step, we can evaluate and add the Gaussian version of the standard inequality R2 < I{X2;Y2\Xi) 
Q, to these bounds; the corresponding inequality is 

P2< ^l0g(l + (l-pL)^2) (51) 

As a result, the outer bound is as given in Lemma [T] ■ 
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